The minimum description length principle for pattern mining: a survey
نویسندگان
چکیده
Abstract Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the selection constitutes major challenge. The Minimum Description Length (MDL) principle, model method grounded information theory, has been applied to pattern mining with aim obtain compact high-quality sets patterns. After giving an outline relevant concepts from theory and coding, we review MDL-based methods for different kinds various types data. Finally, open discussion on some regarding these methods.
منابع مشابه
Minimum Description Length Principle
The minimum description length (MDL) principle states that one should prefer the model that yields the shortest description of the data when the complexity of the model itself is also accounted for. MDL provides a versatile approach to statistical modeling. It is applicable to model selection and regularization. Modern versions of MDL lead to robust methods that are well suited for choosing an ...
متن کاملA tutorial introduction to the minimum description length principle
This tutorial provides an overview of and introduction to Rissanen’s Minimum Description Length (MDL) Principle. The first chapter provides a conceptual, entirely non-technical introduction to the subject. It serves as a basis for the technical introduction given in the second chapter, in which all the ideas of the first chapter are made mathematically precise. This tutorial will appear as the ...
متن کاملUsing Evolutionary Programming and Minimum Description Length Principle for Data Mining of Bayesian Networks
We have developed a new approach (MDLEP) to learning Bayesian network structures based on the Minimum Description Length (MDL) principle and Evolutionary Programming (EP). It employs a MDL metric, which is founded on information theory, and integrates a knowledge-guided genetic operator for the optimization in the search process. In contrast, existing techniques based on genetic algorithms (GA)...
متن کاملOptimization Framework with Minimum Description Length Principle for Probabilistic Programming
Application of the Minimum Description Length principle to optimization queries in probabilistic programming was investigated on the example of the C++ probabilistic programming library under development. It was shown that incorporation of this criterion is essential for optimization queries to behave similarly to more common queries performing sampling in accordance with posterior distribution...
متن کاملMinimum Description Length Principle for Linear Mixed Effects Models
The minimum description length (MDL) principle originated from data compression literature and has been considered for deriving statistical model selection procedures. Most of the existing methods that use the MDL principle focus on models with independent data, particularly in the context of linear regression. This paper considers data with repeated measurements and studies the selection of fi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Mining and Knowledge Discovery
سال: 2022
ISSN: ['1573-756X', '1384-5810']
DOI: https://doi.org/10.1007/s10618-022-00846-z